Analysis of a threshold model of social contagion on degree-correlated networks. 
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We analytically determine when a range of abstract social contagion models permit global spread- 
ing from a single seed on degree-correlated, undirected random networks. We deduce the expected 
size of the largest vulnerable component, a network's tinderbox-like critical mass, as well as the 
probability that infecting a randomly chosen individual seed will trigger global spreading. In the 
appropriate limits, our results naturally reduce to standard ones for models of disease spreading 
and to the condition for the existence of a giant component. Recent advances in the distributed, 
infinite seed case allow us to further determine the final size of global spreading events, when they 
occur. To provide support for our results, we derive exact expressions for key spreading quantities 
for a simple yet rich family of random networks with bimodal degree distributions. 
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Over the last decade, the study of real-world, complex 
networks has grown enormously, fueled in no small part 
by the advent of readily available, large-scale data sets for 
real systems 0, Q . Understanding the coupled dynam- 
ics of both the structural evolution of, and processes on, 
complex networks remains a fertile area of investigation. 
Of particular importance is the study of contagion, how 
entities spread through networked systems, as exempli- 
fied by the diffusion of practices, beliefs, ideas, and emo- 
tions in social networks [!, 0, |aL disease contagion in 
human and animal populations p, 0] , cascading failures 
in electrical systems |& K| , the global spread of computer 
viruses on the Web [Rj, LLI| , and the collapse of financial 
systems 

Our present interest lies in contagion processes where 
individuals adopt alternate behaviors through imitation 
of peers. We specifically investigate a threshold model 
of social contagion on random networks — first proposed 
and studied by Watts [l3| — with the added complication 
of arbitrary degree-degree correlations. The model's ori- 
gins lie in the seminal work of Schclling, who employed 
a threshold model for a population on a checkerboard to 
gain insight into residential segregation 0, [H[ • Gra- 
novetter [1 61 ] subsequently studied a mean-field, random 
mixing threshold model which can be seen as a natu- 
ral limiting case of the model we consider here. Both 
Schelling and Granovetter's work clearly showed that 
global uniformity should not be taken to mean that indi- 
viduals have strong or similar preferences, and that small 
changes in the distributions of individual preferences 
could lead to sharp transitions in the system's macro- 
scopic state. 

Placing the threshold model on standard random net- 
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works with arbitrary degree distributions gives rise to a 
number of novel behaviors not seen in the Granovetter 
model. For example, individuals are now distinguish- 
able, and a single node changing its state can lead to 
a complete transition in the entire system's state [llj]. 
Moreover, by greatly limiting nodes' knowledge of the 
complete network, behaviors that would immediately die 
out when nodes are aware of all other nodes' states may 
now spread globally. Balkanization enhances innovation. 

We consider binary systems with nodes being in one 
of two states, <7o and o\. Initially, all nodes are in state 
fT . Our immediate interest is in determining whether 
or not global spreading is possible when a single node is 
switched to state <j\. and how this condition is affected 
by altering the level of degree-degree correlations. We 
then further wish to know two key quantities: (1) the 
probability global spreading occurs (ftrig), and (2) when 
it does, to what fraction of the entire network (S). 

In finding the probability that global spreading takes 
off after switching a single node to state o\ , it is enough 
for us to view the problem as one of standard percola- 
tion [It], [U[ ■ (Determining the final extent of spreading 
requires a distinct approach but nevertheless still capi- 
talizes on the locally branching nature of random net- 
works [l9|, This observation follows from several 
well known aspects of random networks. 

First, infinite, sparse random networks are locally per- 
fectly branching networks, possessing only very long 
cycles when sufficiently connected. Thus, a node can 
switch to state o\ in the initial stages of a spreading pro- 
cess (started by a single seed) only if a single neighbor 
switches to o\ earlier on. For global spreading to occur, 
a network must have a percolatin g co mponent of these 
easily switched 'vulnerable' nodes (T^j . (By percolating 
component, we mean a connected sub-network containing 
a non-zero fraction of all nodes.) As the adoption of state 
0\ spreads through this percolating component of vulner- 
able nodes, non- vulnerable nodes requiring two or more 
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neighbors in state a\ may also begin to switch. Howev- 
er, we need focus only on vulnerable nodes to determine 
whether global spreading is possible or not. When a per- 
colating vulnerable component exists, it may be viewed 
as a network's critical mass, and one that is highly sus- 
ceptible since it possesses its own critical mass — exactly 
any one of its own members. 

Second, our analytic treatment via generating func- 
tions is limited to the description of finite network com- 
ponents, which indirectly allows us to describe some 
aspects of infinite components. Since finite components 
are pure branching structures (i.e., they contain no cycles 
of any length) , all nodes can only switch to o\ due to the 
conversion of a single neighbor, even in the long run. 

We structure the paper as follows. After defining the 
model fully in Section []] we detail a series of analytic 
results in Section [n] concerning the probability and size 
of macroscopic spreading events. We confirm, via further 
analysis and simulations, a number of our calculations for 
a simple network containing two kinds of nodes in Sec- 
tion lllH We offer some concluding remarks in Sect ion HVl 



I. MODEL DEFINITION 

Each node is initially assigned a 'response function' 
which we take here to be a step function. In effect, each 
node is given a fixed threshold r sampled from a distri- 
bution -Pthreshoid(i")- Node states update in synchrony 
at times t = 0, 1, 2, ...Each node observes the frac- 
tion of its neighbors in state <7i, and switches to <j\ if 
this fraction meets or exceeds its threshold, r. Once a 
node switches to state cri, it remains in state <j\ perma- 
nently (akin to the SI model for disease spreading [(|). 
Asynchronous updating gives the same results for mono- 
tonically increasing response functions. 

We define the structure of our networks through edge 
probabilities, following Newman [2l|, [2^|. In studying a 
range of structural aspects of degree-correlated networks 
as well as dynamic phenomena on them (especially con- 
tagion processes) , it is mathematically convenient to use 
ejk-, the probability that a randomly chosen edge connects 
a node with degree j + 1 to one with degree k + 1, rather 
than j and fc. The quantity ejk then refers directly to 
the number of other edges emanating from the nodes an 
edge connects. Normalization is uncomplicated, requir- 
ing that J2T=o Efclo e 3k = 1- 

While this definition of ejk is clear for directed net- 
works, we encounter some subtleties in attempting to 
accommodate both undirected and directed networks 
with a single notation (even though many expressions 
will turn out to be the same). For undirected networks, 
which are our primary focus here, we evenly divide the 
probability that a randomly selected edge (now direction- 
less) connects nodes with degree j + 1 and k + 1 between 
the quantities ejk and ey. The chance of a randomly 
selected edge connecting degree j + 1 and fc + 1 nodes is 
then ejk + ekj = 2ejk, and the matrix formed by the ejk 



is symmetric. We are thus effectively retaining a ghost of 
a directed network, as each link must have a designated 
first and second node. (For directed networks, ejk need 
of course not be symmetric, and other complications are 
possible concerning the correlations between an individ- 
ual node's in-degree and out-degree.) 

We also have the important quantity qk which is the 
probability that in randomly choosing an edge, and then 
randomly choosing one end of that edge, we arrive at a 
node of degree fc+1 (equivalently, the node has k emanat- 
ing edges). For undirected networks, we have the deriva- 
tion q k = 5Ej°lp e ifc + | Ejlq e fcj = J2T=o e Jk- ( Th c 
same end expression holds for directed networks, where 
we must follow the edge's direction.) 

The Pearson correlation coefficient r for degree pairs 
gives us a measure of assortativity, and is given by 

_^ oo oo 
a i j=0k=0 

where a 2 = J2k k 2 Qk ~ Efc kqk] 2 is the variance in the 
number of emanating edges from a node arrived at by 
a random edge. Note that the choice of ejk almost pre- 
scribes the form of the resulting network's degree distri- 
bution, pk- The one piece of information missing is the 
abundance of nodes with no connections, p , which we 
must define independently A link between the ejk andpt- 
for undirected networks follows from the observation that 
qk is readily determined from the degree distribution: a 
randomly chosen edge leads to a degree k + 1 node with 
probability q k = (k+l)p k +i / 'fc avg where fc avg = Y,T=o k Pk 
is the average degree. We therefore have the connection 
Ik = J2^=o e Jk = (k + l)pk+i/k mg . Now, in isolating the 
Pk, we do not need to know fc avg . It is enough to know 
Pk oc qkjik + 1) for k > 0, since we can find the nor- 
malization constant (which is in fact fc avg ) by requiring 

Finding the probability of triggering a global spreading 
event reduces to standard percolation once we find the 
probability that an individual node is vulnerable. We 
allow this probability to be a function of node degree 
fc, using the notation bki (more generally, we write the 
probability that a node of degree fc switches to state o\ 
given I contacts in state o\ as bki). We first determine 
whether nodes are vulnerable or not, removing them from 
the network in the latter case. Finally, if global spreading 
occurs on the resulting reduced network, global spreading 
must occur on the original network. 

II. ANALYTIC RESULTS 

Building on the work of Newman [2l[ , we find closed 
form expressions for several probability generating func- 
tions related to component sizes. The key probability we 
need to characterize is f n j, which is the probability that 
an edge emanating from a degree j + 1 node leads to a 
finite vulnerable subcomponent of size n = 0,1,2,.... 
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Writing the marginal generating function for / n j as where 



relationship: 



we have the following recursive 



F j (x- ) b 1 )=x°J2—(l-bk+i,i) 



(2) 



where j = 0, 1, 2, In both terms, we have the quantity 

e jk/lj which represents the normalized probability that 
an edge from a degree node j + 1 leads to a degree fc + 1 
node. For undirected networks, we obtain this probabil- 
ity as (e jk + e kj )/(J2T=o e rt + e *=i) = 2e jk/2q 3 = e jk /qj. 
The first term in Eq. records the probability that an 
edge from a degree j + 1 node leads immediately to a non- 
vulnerable node, and hence a vulnerable subcomponent 
of size 0. 

The second term involves a composition of generating 
functions. The generating function F k is the argument 
of 



— bk+\,ix" 

k =o 



(3) 



which is itself the generating function for the probability 
that an edge from a degree j+1 node leads to a vulnerable 
node with k emanating edges [HI . Finally, the x leading 
the second term in Eq. |2]) accounts for the k + 1 degree 
node itself. 

Our task is now to find critical points indicating the 
onset of a giant component as we vary network structure 
by altering the ej k and b k \. We compute the average 
size of a finite vulnerable component found by following 
an edge from a degree j + 1 node, given by 
Differentiating Eq. ©, setting x = 1, and substituting 
9j = Sfclo e rt, we have 



q j F({l;b 1 ) = Y,e j kb k +i,i+Y, ke jkbk+i,iFi(l;b 1 ). (4) 



(We have used the fact that i*fc(l;f>i) = 1 for networks 
without a giant component, and all components con- 
tribute to the generating function of f n .k-) Rearranging 
Eq. ([5]), we have a linear system 



^ (SjkQk - kb k+ i^e jk ) Ff.(l; h) = ^ e jkbk+i,i (5) 



fe=0 



which we write as 



fc=0 



A E ,F'(l;6i)=E& 1 



(6) 



1 J j + l.fe+1 



Sj k q k - kbk+i,iejk 



J Kl 



F'(l;h) =fZ(l;6i), 
J fe+i 



and 



fe+i 



bk+i.i 



(7) 

(8) 
(9) 
(10) 



A solution exists when A E ^ 



for j, k = 0, 1, 2, 3, 
is invertible, i.e., its determinant is non-zero. We then 
have 



F'(l;b 1 ) = A^Eb 1 . 



(11) 



Our condition for the onset of global spreading is there- 
fore 



A - 



0. 



(12) 



with A E g defined in Eq. ([7]). We note that for networks 
where one or more degrees are not present (which we will 
encounter in our later specific examples), we omit the 
rows and columns of the above matrices corresponding 
to those degrees. For simplicity, we present our general 
results for networks assuming all degrees are represented, 
observing that adjustments to specific sets of degrees are 
straightforward. 

For uncorrelated networks, upon substituting ej k = 
qjq k , the above collapses to the known condition 



bki(k 



kp k 



(13) 



k=0 



as expected [i3| . Moreover, F^(l;&i) is seen to be inde- 
pendent of k, since the first node of an edge is now unre- 
lated to the other node and hence also the subcomponent 
it leads to. We provide details for these calculations in 
the Appendix. 

Returning to the general case of degree-correlated net- 
works, we see that when b k \ = b, a constant for all k, 
we have a disease-like contagion process. Furthermore, 
when b = 1, all nodes are vulnerable, and we have found 
the condition for a giant component, equivalent to that 
obtained by Newman [2l|. (Note that when b k \ = b, 
we have E6i = bq since [E6i]j+i = J2T=o e jkb k +i.i = 

b Y,kLo e 3k = 6 ?J+lO 

We next consider two probability distributions pertain- 
ing to component size: (1) g n , the probability that a ran- 
domly chosen node belongs to a vulnerable component 
of size n, and (2) h n , the probability that a randomly 
chosen node belongs or is adjacent to a vulnerable com- 
ponent of size n. Knowing g n helps us find the size of the 
largest vulnerable component, whose presence or absence 
dictates whether or not global cascades are possible for 
infinite random networks. The second probability h n will 
aid us in determining the probability of triggering a glob- 
al cascade. The triggering node, which is exogenously 
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switched to state u\ may be either vulnerable and part 
of the largest vulnerable component, or non-vulnerable 
and connected to one or more nodes in the largest vul- 
nerable component. For the standard giant component 
case where bki = 1, wc have Qji — h n 5 otherwise, these 
distributions are likely distinct. 

As for the fk,m we find closed form expressions for 
the generating functions associated with g n and h n . The 
generating function for g n satisfies the following relation- 
ship: 

oo 

G(x;bi) = x°^p k (l - b k x) + x 1 p b m 

k=0 



x^pkbki Fk-i{x;bi) . (14) 
fc=i 



The x° term carries the probability that a randomly 
selected node will not be in state ai; the second term 
accounts for the randomly chosen node being vulnerable 
but having degree 0; and the third term again uses the 
composition rule for sums of random variables of random 
sizes [23|. The generating function J2kLiPkbkix k cor- 
responds to the probability distribution for a randomly 
chosen node to have degree k and be vulnerable. Note 
that the argument Fk—\ appears in the last term rather 
than Fk because Fk—i by definition corresponded to a 
degree k node. 

The generating function for the triggering distribution 
satisfies a simplified version of Eq. JTj 



(15) 



OO 

H{x\bi) = x 1 p + x^pk ^fc_i(x;6i) 



k=l 



For the triggering distribution, the first node is now 
always switched to state <?\, regardless of whether it is 
itself vulnerable or not. The x 1 term accounts for this 
initial node having degree and hence being unable to 
trigger any other node. If the initial node has at least one 
neighbor, then spreading may occur and we can make 
use of the basic degree generating function Y^k=iPkX k 
combined with the generating function Fk-i(x;b\) for 
finite vulnerable subcomponent size. As per our previ- 
ous examples, the preceding factor x in the second term 
accounts for the initial node. In effect, bki = 1 for the 
triggering node as it is always forced to be switched to 
state (Ti, and indeed setting bki = 1 in Eq. (TT4")) directly 
yields Eq. ([15]). 

Now, the fraction of nodes in the largest vulnerable 
component is given by SVuin = 1 — G(l; bi), since G(l; bi) 
can be seen as the probability that a random node is part 
of a finite vulnerable component (including one of size 0). 
Setting x = 1 in Eq. (Ti"4"|) , we have 



i fc 



S vu i a =1 - G(l; h) 

OO OO 

= ^Pkbki -Po&oi -^Pkbki F k -i(l;bi] 



k=0 



fe=l 



(16) 



In the same fashion as for S vu \ n , the probability of trig- 
gering a cascade can be determined using H(x; b\): 



Strig=i--ff(i;6i) 



=i - 



OO 

5> fc [F fc -i(i;&i) 



k=0 



(17) 



The size of the triggering component can also be obtained 
by first making the observation that an initially switched 
node of degree k triggers a cascade with probability 



'trig 



1 



F fc _i(l;6i) 



(18) 



This is the probability that at least one edge from a 
degree k node leads to a giant component of vulnera- 
ble nodes. We then have Strig = J2kLo Pfc^trig' which is 
in agreement with Eq. (fT7|) . 

Both S'vuin and Strig depend on i*fc(l; &i), for which we 
obtain a potentially infinite set of closed-form, coupled, 
nonlinear recursive expressions from Eq. 



F J (i;&i) = E^( 1 - 6 ^ 



to * 



'fc+i,i 



F fc (l;6i 



(19) 

where we have substituted qj = X)fcLo e i fe - Solving 
Eq. (|19|) will almost always involve numerical techniques 
(though in Section IlIIl we examine a case that has ana- 
lytic solutions). The uncorrelated, pure random network 
version for the giant component (bki = 1) may serve as 
some inspiration. There, we have no dependence on the 
degree of the initial node and the problem reduces to 
solving F(l,l) = ££L gfc[f(l,I)]*. Both F(l,f) = 
and F(l, 1) = 1 are solutions and any initial estimate in 
between leads to, upon iteration, an intermediate solu- 
tion, if one exists. Similarly, iteration of Eq. (|19[) should 
generally reach the appropriate fixed point solution. 

We can also and more easily determine the average 
size of all vulnerable components (applicable when no 
giant vulnerable component is present since we again use 
F k (l;h) = 1): 

G'(l,h) =p I b 1 + fc avg (Efe 1 ) T A E 1 - i (Eh), (20) 

where [p\k = Pk- If we allow spreading to start from any 
node by forcing the first node to be in state ai , then the 
average size is instead given by 



H'(l,h) 



; (Ef 



l T A E W E M- 



(21) 



We complete our analysis with a description of the 
expected final size of a global spreading event, when it 
occurs. We use the results of recent work by Gleeson and 
Calahane [l^|, and subsequently Gleeson [20], who have 
explicated an elegant, general solution to the distributed, 
infinite seed case for a variety of spreading models on a 
wide range of random networks. 
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Their key observation is that a node i (degree kt) 
switching to state 0\ at time step n can only be due to 
previous switching in nodes within n steps of i (includ- 
ing, trivially, i itself). Furthermore, this neighborhood 
oii must be a pure branching network, i.e., a tree rooted 
at i. At time step n = 1, only i's immediate neighbors 
influence i, and these are in state o\ with probability 4>o- 
We write the probability that one of i's edges connects 
to a node in state o\ at time step n+ 1 as Ok it n- At time 
step n = 2, the effect of next neighbors is felt, and i's 
degree k neighbors are now in state u\ with probabili- 
ty &k,i which in turn depends generally on 9k', a = 4>o, 
k' > 1. Allowing n to increase, we are lead to a recursive 
expression for the 9k,n as follows. The expected size of a 
global spreading event given a fraction 4>o of nodes active 
at time t = is <f>oo , which is obtained in the n — > oo limit 
of the following equations [2(| : 



-l = + (i - <t>o)^2pk ^2 



where 



9j,n+l = <j>0 + (1 
fe=l 



E 



E 

i=0 



fc=0 i=0 



0o)x 

fe - 1 



1 k,nV L ~ Vk,n) 



\k—i i 



(22) 



(23) 



k-l-i 



bki 



and, again, 6k.o = 0o an d bki is the probability that 
a degree fc node switches to state a\ if exactly i of its 
neighbors have switched. 

We thus observe a symmetry between the explana- 
tions for when global spreading may occur from a single 
seed and if so, to what extent. The former concerns the 
progress of a spreading event as it moves outward from 
a single seed through a random branching network, and 
the latter hinges on how switching converges on a cen- 
tral node, again traversing a branching network, but in 
the reverse direction (this process is similarly described 
in [24j for a particular disease spreading model involving 
repeated contacts). 

Note that <p n must always increase or stay the same, 
since nodes never switch off, and must therefore approach 
a limit as n —> oo. This implies that node i feels the effect 
of initial activations within a finite number of steps, and 
indeed we see that the approach to 4>oo is rapid. Whether 
or not a small seed takes off can be determined from 
Eq. (|23|) by examining the matrix d8j, n+ i/d9k, n evaluat- 
ed at Ok.n = (20| . and yields a condition equivalent to 
our requirement |A| = (Eq. ifTS])). 

While these equations are derived with the assumption 
of an infinite seed (if an arbitrarily small one, fraction- 
wise), they nevertheless perform extremely well in the 
limit <fio — » 0. Even so, they cannot capture the salience 
of a global vulnerable component for the single seed case, 
which must be present for spreading to succeed. By com- 
parison, for the infinite seed model, spreading always 
occurs if the cascade condition is met. However, as noted 



in [20l | . by separately determining Strig, we can account 
for the major features of the single seed model: the prob- 
ability of generating a global spreading event, Strig, and 
its expected size (poo . We note that in [20j , an expression 
for Strig for uncorrelated random networks was obtained 
using Ooa] here, we have generalized St r i g to uncorrelated 
networks directly from the standard generating function 
approach. 



III. APPLICATION TO A SIMPLE FAMILY OF 
RANDOM NETWORKS 

For a tractable test case, we examine a family of infinite 
random networks with nodes having either of two degrees 
k a and fcf, where k a < k^. We consider the general class 
of threshold profiles b\ such that degree k a nodes are 
vulnerable and degree kb nodes are not. We are able 
to obtain expressions for the size of the vulnerable and 
triggering components as a function of assortativity, and 
we make comparisons with simulations for the (fc Q , kb) = 
(3, 4) case. 

We wish to consider the full range of assortativity 
and this constrains the relative probabilities of the two 
degrees in the following way. We specifically need 



h 

Pk a = TrPk b , 



(24) 



since when r = —1, degree k a nodes connect only to 
degree kb nodes, and so there must be relatively fewer of 
the latter. We therefore have 



kn 



Pk a 



Pk„ 



and pk = otherwise. 



(25) 

Values of r other than -1 add no further constraints. The 
corresponding E matrix is independent of k a and kb and 
has the form 



f 

E = - 

4 



1 + r 1 — r 
1 — r I + r 



(26) 



where we have left out irrelevant rows and columns of 
zeros. The condition for a giant component given by 
Eqs. O and (JT^J) is then 







l A E,Sj 



l-i(k a -l){l + r) 
i(fc -l)(r-l) 1 

|_i(ft -l)(l + r). 



(27) 



We therefore have a phase transition when 

2 

r = -1 



k a -l 



(28) 



with a giant component of vulnerable nodes arising and 
then growing as assortativity increases. (Note that when 
no giant component exists, |A| is positive.) For the case 
k a = 3, the phase transition occurs at r — 0, and as k a 
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increases, the location of the phase transition moves to 
more negative values of r. 

We next determine the size of the vulnerable and trig- 
gering components. We find the probabilities of reach- 
ing a finite component from degree k a and kb nodes via 
Eq. COT 



and 



Jlk a _i(l;6i) = 



1 — r 1 + r r 



2 2 

1 + r 1 — r " 



^fc.-i(i;6i) 
*k„-i(l;k) 



> (29) 
(30) 



Only the first equation need be solved and is in general 
a k a — 1 degree polynomial in Ffe a _i(l; fei ). We now 
specialize our results for the case fc a = 3. We have a 
quadratic equation from which we obtain 



1 for r < 0, 
fe£ forr>0. 

1+r — 



(31) 



and 



F fc6 _ 1 (l;fc 1 ) = ^ i£. 



l-r 
1+r 



for r < 0, 
for r > 0. 



(32) 



Note that in finding ^(lj&i), we have had to make 
a choice between the two roots of the quadratic given 
in Eq. (|31|) . First, we have only one realistic solution in 
the interval [0,1] for r < 0, namely i*2(l;&i) = 1 (the 
other solution, (1 — r)/(l + r), exceeds 1 for r < 0). 
Second, in considering the limit r = 1 where degree 3 
nodes are connected only to other degree 3 nodes form- 
ing a giant component, we require that the probability of 
reaching a finite component must drop to 0. We there- 
fore see that (1 — r)/(l + r) must be the correct choice 
since then F 2 (l;&i) = at r = 1. We further observe 
that the two roots cross when r = 0, and for r > 0, the 
root (1 — r)/(l + r) is less than the root at unity. Thus, 
by an expectation of continuity, (1 — r)/(l + r) must be 
the solution for all of r > 0. 

We next determine the size of the vulnerable com- 
ponent and triggering probabilities as functions of r 
using Eqs. l[TB)). (fI7|). and (JTHJ). For r < 0, we have 
Svuin = Strig = 0, and for r > 0, 



•S y 



o(3) _i _ 
u trig A 



°trig 1 



1 



(33) 



(34) 
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trig 



1 -r 



fc b + 3 V 1 + r 



1 + r 



(35) 



k b + 3 V 2 



l-r 
1 + r 



As shown in Fig. H^a), we obtain excellent agreement 
between equations and the output of simulations for 
ki, = 4 on networks with N = 0.98 x 10 5 nodes and 
t = 1/3. We used this particular value of N since 
it is the nearest number to 10 5 that is divisible by 7 
[required by Eqs. (|24j) and ([2"5])] and, for convenience, 
also a multiple of 100. As was observed for the origi- 
nal threshold model on random networks [l3j . we find 
that well-connected networks near the transition to a 
non-percolating largest vulnerable component are of a 
'robust-yet-fragile' nature (26[. We see in Fig. HJa) that 
if assortativity r is just slightly positive, and if the initial 
seed is in the small triggering component present, then 
spreading reaches the entire network. These otherwise 
highly resilient networks have an Achilles heel that leads 
to a complete transition in individual node states. Fur- 
thermore, finite size effects may be significant: for the 
networks with N = 9800 nodes, our simulations show 
that global spreading is possible for r as low as —0.07. 

While our expressions for Svuin and Strig only depend 
coarsely on b\ , the final extent of a global spreading event 
S is more sensitive to changes in node response functions. 
Taking as an example one of our bimodal networks with 
k a = 3 and kb = 10, we see any value of r in ( jq, i] is 
equivalent to r = i, as far as Svuin and S tr i g are con- 
cerned, since only degree 3 nodes are vulnerable. How- 
ever, whether or not degree 10 nodes are vulnerable to 
i neighbors switching to state <j\ depends on r meeting 
and exceeding For the case k a = 3 and kb = 4, the 
results we present here apply for all r in (j, g]. 

For the final size of global spreading events, S, manip- 
ulations of Eq. (|2"3"1) for k a = 3 and kb = 4 show we need 
to solve the following for 63.00: 



73,00 



where 



1^^2-^)03^ + 1^(3-^,00)^ (36) 



74,00 



1 -r 



7 3,oo 



(l-3r + 2r# 3 ,oo) (37) 



for < r < 1. Knowing that 63,00 = 1 is a solution 
of Eq. (|36j) means we can reduce the problem to solving 
for the roots of a quartic, for which we could obtain a 
complete analytic solution. For our purposes, numerical 
simulation is sufficient. 

Our simulations and numerical analyses indicate the 
presence of another kind of phase transition in S. At the 
upper limit of r = 1, the network is separated into two 
giant, fully-connected components, one comprising solely 



7 



1 1 1 1 r 

1 ■ ♦♦♦♦♦♦♦♦♦♦♦♦♦♦♦♦♦♦♦♦♦ 

: s 




0.2 0.4 0.6 0.8 1 
assortavity r 



FIG. 1: (a) Three key spreading measures as a function of 
assortativity r for networks with Pk = jSks + f <5fc4. Solid 
lines indicate theoretical curves and symbols represent mea- 
surements from simulations. The three curves are (1) the 
fractional size of the largest vulnerable component, 5 V uin, 
(Eq. (|33[) . circles); (2) the fractional size of the largest trig- 
gering component, Strig, (Eq. (|35|1 . squares); and (3) the 
fractional size of global spreading events S (Eq. (|22p . dia- 
monds). The upper discontinuous phase transition in S occurs 
at r = r^ ppcr = 0.77485 ± 0.00001. The data shown were 
obtained from networks with iV=0.98 x 10 5 nodes (correlations 
were generated using a shuffling algorithm described in [25l |) 
with initial seeds placed at each node of 10 sample networks 
for a total of 9.8x10 samples per each value of r. (b) Solu- 
tions for #3 j00 and #4,00, the probability that edges leading to 
degree 3 and 4 nodes will be from nodes in state 01 in the 
long time limit (see Eqs. (|36)) and (|37)) .') Solid and dashed 
lines indicate stable and unstable ones respectively. When r 
reaches r" ppor , a repeated solution appears for (#3,00, 84,0a), 
distinct from (1, 1), giving rise to the discontinuous upper 
phase transition in S. 



k = 4 node is connected to two (or more) degree k = 3 
nodes in the giant vulnerable component increases. The 
expected size </>oo of a global spreading event grows grad- 
ually until it reaches a discontinuous phase transition and 
</>oo jumps to 1. Through numerical analysis of Eq. (|36p . 
we see that as r increases, a repeated root #3,00 appears 
when r = r^ ppor ~ 0.775. The roots become real and sep- 
arate with both of them limiting to 1 as r — ► 1. Fig. [Ub) 
shows how both #3,00 and #4 j00 behave as a function of 
r. Note that the discontinuity in the final size of a global 
spreading event is not reflected in the curves for S t rig or 
S'vuin, which both change smoothly for < r < 1. 



IV. CONCLUDING REMARKS 

We have developed a scries of theoretical expressions 
for a general class of spreading models on random net- 
works with tunable degree assortativity r. Our main 
results are the derivation of the fractional size of the 
giant vulnerable and triggering components as a func- 
tion of degree-degree distribution ejk- When we allow all 
nodes to be susceptible with uniform probability b, our 
results reduce to the standard disease-like spreading case, 
and when we further set b = 1, wc obtain the conditions 
for the existence of a giant component. 

When we set r = 0, we also retrieve the condition for 
the existence of a giant vulnerable component (Eq. (fT2|) ) 
along with its size (Eq. (|16[) ) in pure random networks 
with arbitrary degree distributions (l3j . For the trigger- 
ing component, aside from our work for general r, wc have 
also added to known results for the r = case. We have 
obtained the probability that an exogenously activated 
node of degree k may trigger a global spreading event 
(Eq. I|18p) and hence the probability that a randomly 
chosen node may do the same ( Eq. (fl~7]) ). Our work com- 
plements the results of Gleeson [2(J which determines the 
final size of a global spreading event (Eqs. (f2"2"|) and l|23p). 
We note that a basic description of many kinds of spread- 
ing from a single initial seed must separately report (1) 
the probability of a global spreading event Ptrig, and (2) 
the expected final size S. The distribution of final sizes 
is often bimodal and an overall expected size conflates 
Ptrig and S in a potentially misleading way, since no size 
other than and S are in fact possible. Finally, the sim- 
ple family of random networks we have considered here 
with Pfc = |<5fc3 + |<5fc4 have allowed us to demonstrate, 
for one example, the validity of our theoretical results for 

^vuln 

(r) and 5 trig (r). 



k = 3 degree nodes and the other k = 4 degree nodes (the 
giant component is of fractional size 1 for all other values 
of r). Spreading therefore occurs only in the k = 3 degree 
node component when r = 1, and, as shown in Fig. [Ha), 
the three quantities S, SVuin, and 5t r i g all equal 4/7 for 
complete assortativity. 

As we decrease r however, the possibility that a degree 
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APPENDIX: CALCULATIONS REGARDING 
COLLAPSE OF GENERAL GLOBAL 
SPREADING CONDITION FOR 
UNCORRELATED NETWORKS 

In the main text, we noted that the general condition 



for global spreading, that 



= (Eq. JT5J), collaps- 



es to the appropriate condition for uncorrelated networks 
when we set ejk = qjq k (Eq. (US]))- We demonstrate this 
in two distinct ways. First, we take a direct route by 
setting ejk = qjqk in the definition of A E £ : 



SjkQk - kb k+ i A qjq k 
(Sjk—qjkbk+i,i)qk- (38) 



The qk that has been factored out stands as a multi- 
ple for the kth column, and therefore, by multilincari- 
ty of determinants (2?| . contributes to the determinant 
of A E ^ as a simple overall multiplicative factor. In 
finding |A E gJ, we can therefore focus on the matrix 
[fijk ~ Qjkbk+i i], We find the determinant of this matrix 
by finding its eigenvalues, which are the eigenvalues of 
[— qjkbk+\ i] incremented by 1 due to the addition of the 
identity matrix. Since [— qjkbk+i,i] is an outer product 
of the two vectors [—qj] and [kb k+ i t i], we have a rank 
one matrix which therefore has a sole non-zero eigen- 
value corresponding to an eigenvector in the direction 
of [— qj]. We find the non-zero eigenvalue by explicit- 
ly applying the matrix [—qjkbk+i,i] to [—(ft], giving us 
Ai = — yi%%i kbk+i.iQk- Shifting the summation index, 
we have Ai = — Sfe^=i(^ — an d A, = for 

% > 2. We now add 1 to all eigenvalues to obtain 
Ai = 1 - EZi( k - l)&fc,ifffe-i and A< = 1 for i > 2. 
The determinant of A E ^ is given by the product of the 
A^ along with the product of the q k elements we initially 
factored out: 




l-J2(k-l)b k ,i 



qk-i 



(39) 



k=l 



Since the qk are not equal to 0, the condition that 
A E £ = for uncorrelated networks depends on the 



second term: = 1 — 2fc=i b k ,i{k — \)q k -\. Substituting 
qk = kp k /k avg gives us the desired condition of Eq. (|13[) . 

Secondly, we can also show that F^(l;&i) is indepen- 
dent of k, meaning that the expected size of a finite sub- 
component found at the other end of an edge connected 
to a node of degree k + 1 does not depend on the latter's 
degree. This will also lead us to the same condition for 
global spreading we found above. To do so, we return to 
Eq. ([6]) and set ejk = q 3 qk'- 



^2 ( s jkQk - fc6fe+i,ig J gfc)-Ffc(l;&i) = ^qjqkbk+i, 



k=0 



k=0 



(40) 

Breaking the left hand side into two pieces, carrying out 
the first summation, and then dividing through by qj, we 
obtain 



F'(l; h) - V kb k+1 ^q k F' k {\- bi) = V q k b k+hl . (41) 



k=0 



fe=0 



Upon rearranging, we can now express F!j(\\b\) as a 
function of terms not involving j (even if they include 
summations involving i^(l;6i)), thereby demonstrating 
independence: 



E 

k=0 



kbk+^xqkF^l-h) +J2lkb k +i,i- (42) 



k=0 



Setting F k (l;bi) = F'(l;bi) for all k, the above is now 
solvable, and we find 



F'(l;h) = 



Sfclo qkbk+ij- 

1 - J2kLo kbk+i,iqk' 



(43) 



Finally, we sec that setting the denominator to recovers 
the condition for global spreading in uncorrelated net- 
works. 
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